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Probability Topics: Introduction 
This module introduces the concept of Probability, the chance of an event 
occurring. 


Student Learning Objectives 
By the end of this chapter, the student should be able to: 


e Understand and use the terminology of probability. 

e Determine whether two events are mutually exclusive and whether two 
events are independent. 

e Construct and interpret Contingency Tables. 


Introduction 


It is often necessary to "guess" about the outcome of an event in order to 
make a decision. Politicians study polls to guess their likelihood of winning 
an election. Teachers choose a particular course of study based on what they 
think students can comprehend. Doctors choose the treatments needed for 
various diseases based on their assessment of likely results. You may have 
visited a casino where people play games chosen because of the belief that 
the likelihood of winning is good. You may have chosen your course of 
study based on the probable availability of jobs. 


You have, more than likely, used probability. In fact, you probably have an 
intuitive sense of probability. Probability deals with the chance of an event 
occurring. Whenever you weigh the odds of whether or not to do your 
homework or to study for an exam, you are using probability. In this 
chapter, you will learn to solve probability problems using a systematic 
approach. 


Optional Collaborative Classroom Exercise 


Your instructor will survey your class. Count the number of students in the 
class today. 


e Raise your hand if you have any change in your pocket or purse. 
Record the number of raised hands. 

e Raise your hand if you rode a bus within the past month. Record the 
number of raised hands. 

e Raise your hand if you answered "yes" to BOTH of the first two 
questions. Record the number of raised hands. 


Use the class data as estimates of the following probabilities. 

means the probability that a randomly chosen person in your class has 
change in his/her pocket or purse. means the probability that a 
randomly chosen person in your class rode a bus within the last month and 
so on. Discuss your answers. 


e Find 

e Find 

e Find Find the probability that a randomly chosen 
student in your class has change in his/her pocket or purse and rode a 
bus within the last month. 

e Find Find the probability that a randomly chosen 
student has change given that he/she rode a bus within the last month. 
Count all the students that rode a bus. From the group of students who 
rode a bus, count those who have change. The probability is equal to 
those who have change and rode a bus divided by those who rode a 
bus. 


Probability Topics: Terminology 

Probability: Terminology is part of the collection col10555 written by 
Barbara Illowsky and Susan Dean defines key terms related to Probability 
and has contributions from Roberta Bloom. 


Probability is a measure that is associated with how certain we are of 
outcomes of a particular experiment or activity. An experiment is a 
planned operation carried out under controlled conditions. If the result is 
not predetermined, then the experiment is said to be a chance experiment. 
Flipping one fair coin twice is an example of an experiment. 


The result of an experiment is called an outcome. A sample space is a set 
of all possible outcomes. Three ways to represent a sample space are to list 
the possible outcomes, to create a tree diagram, or to create a Venn diagram. 
The uppercase letter S' is used to denote the sample space. For example, if 
you flip one fair coin, S = {H, T} where H = heads and T = tails are the 
outcomes. 


An event is any combination of outcomes. Upper case letters like A and B 
represent events. For example, if the experiment is to flip one fair coin, 
event A might be getting at most one head. The probability of an event A is 
written P(A). 


The probability of any outcome is the long-term relative frequency of 
that outcome. Probabilities are between 0 and 1, inclusive (includes 0 and 
1 and all numbers between these values). P(A) = 0 means the event A can 
never happen. P(A) = 1 means the event A always happens. P(A) = 0.5 
means the event A is equally likely to occur or not to occur. For example, if 
you flip one fair coin repeatedly (from 20 to 2,000 to 20,000 times) the 
relative fequency of heads approaches 0.5 (the probability of heads). 


Equally likely means that each outcome of an experiment occurs with 
equal probability. For example, if you toss a fair, six-sided die, each face 
(1, 2, 3, 4, 5, or 6) is as likely to occur as any other face. If you toss a fair 
coin, a Head(H) and a Tail(T) are equally likely to occur. If you randomly 
guess the answer to a true/false question on an exam, you are equally likely 
to select a correct answer or an incorrect answer. 


To calculate the probability of an event A when all outcomes in the 
sample space are equally likely, count the number of outcomes for event 
A and divide by the total number of outcomes in the sample space. For 
example, if you toss a fair dime and a fair nickel, the sample space is 
{HH, TH, HT, TT} where T = tails and H = heads. The sample space 
has four outcomes. A = getting one head. There are two outcomes 

{HT, TH}. P(A) =2. 
Suppose you roll one fair six-sided die, with the numbers {1,2,3,4,5,6} on 
its faces. Let event & = rolling a number that is at least 5. There are two 
outcomes {5, 6}. P(E) =2. If you were to roll the die only a few times, 


you would not be surprised if your observed results did not match the 
probability. If you were to roll the die a very large number of times, you 
would expect that, overall, 2/6 of the rolls would result in an outcome of "at 
least 5". You would not expect exactly 2/6. The long-term relative 
frequency of obtaining this result would approach the theoretical probability 
of 2/6 as the number of repetitions grows larger and larger. 


This important characteristic of probability experiments is the known as the 
Law of Large Numbers: as the number of repetitions of an experiment is 
increased, the relative frequency obtained in the experiment tends to 
become closer and closer to the theoretical probability. Even though the 
outcomes don't happen according to any set pattern or order, overall, the 
long-term observed relative frequency will approach the theoretical 
probability. (The word empirical is often used instead of the word 
observed.) The Law of Large Numbers will be discussed again in Chapter 7. 


It is important to realize that in many situations, the outcomes are not 
equally likely. A coin or die may be unfair, or biased . Two math 
professors in Europe had their statistics students test the Belgian 1 Euro 
coin and discovered that in 250 trials, a head was obtained 56% of the time 
and a tail was obtained 44% of the time. The data seem to show that the 
coin is not a fair coin; more repetitions would be helpful to draw a more 
accurate conclusion about such bias. Some dice may be biased. Look at the 
dice in a game you have at home; the spots on each face are usually small 
holes carved out and then painted to make the spots visible. Your dice may 
or may not be biased; it is possible that the outcomes may be affected by the 


slight weight differences due to the different numbers of holes in the faces. 
Gambling casinos have a lot of money depending on outcomes from rolling 
dice, so casino dice are made differently to eliminate bias. Casino dice have 
flat faces; the holes are completely filled with paint having the same density 
as the material that the dice are made out of so that each face is equally 
likely to occur. Later in this chapter we will learn techniques to use to work 
with probabilities for events that are not equally likely. 


"OR" Event: 

An outcome is in the event A OR B if the outcome is in A or is in B or is 
in both A and B. For example, let A = {1, 2, 3, 4, 5} and 

B= {4, 5,6, 7,8}. A OR B = (1, 2, 3, 4, 5, 6, 7, 8}. Notice that 4 
and 5 are NOT listed twice. 


"AND" Event: 

An outcome is in the event A AND B if the outcome is in both A and B at 
the same time. For example, let A and B be {1, 2, 3, 4, 5} and 

{4, 5, 6, 7, 8}, respectively. Then A AND B = {4, 5}. 


The complement of event A is denoted A’ (read "A prime"). A’ consists of 
all outcomes that are NOT in A. Notice that P(A) + P(A’) = 1. For 
example, let S = {1, 2, 3, 4, 5, 6} and let A = {1, 2, 3, 4}. Then, 

A’ = {5, 6}. P(A) =4, P(A’) =2, and P(A) + P(A’) =4+2=1 
The conditional probability of A given B is written P(A|B). P(A|B) is 
the probability that event A will occur given that the event B has already 
occurred. A conditional reduces the sample space. We calculate the 
probability of A from the reduced sample space B. The formula to calculate 
P(A|B) is 


P(A|B)= SST a B) 


where P(B) is greater than 0. 
For example, suppose we toss one fair, six-sided die. The sample space 


S = {1, 2, 3, 4, 5, 6}. Let A = face is 2 or 3 and B = face is even (2, 4, 6). 
To calculate P(A|B), we count the number of outcomes 2 or 3 in the 


sample space B = {2, 4, 6}. Then we divide that by the number of 
outcomes in B (and not S). 


We get the same result by using the formula. Remember that S has 6 
outcomes. 


P(A|B) = 
P(A andB) __ (the number of outcomes that are 2 or 3 andeveninS)/6 1/6 1 
P(B) (the number of outcomes that are even in S) / 6 ~ 3/6 3 


Understanding Terminology and Symbols 

It is important to read each problem carefully to think about and understand 
what the events are. Understanding the wording is the first very important 
step in solving probability problems. Reread the problem several times if 
necessary. Clearly identify the event of interest. Determine whether there is 
a condition stated in the wording that would indicate that the probability is 
conditional; carefully identify the condition, if any. 

Exercise: 


Problem: 


In a particular college class, there are male and female students. Some 
students have long hair and some students have short hair. Write the 
symbols for the probabilities of the events for parts (a) through (j) 
below. (Note that you can't find numerical answers here. You were not 
given enough information to find any probability values yet; 
concentrate on understanding the symbols.) 


e Let F be the event that a student is female. 

e Let M be the event that a student is male. 

e Let S be the event that a student has short hair. 
e Let L be the event that a student has long hair. 


e a The probability that a student does not have long hair. 

¢ b The probability that a student is male or has short hair. 

¢ c The probability that a student is a female and has long hair. 

e d The probability that a student is male, given that the student has 
long hair. 


e e The probability that a student has long hair, given that the 
student is male. 

e f Of all the female students, the probability that a student has 
short hair. 

e g Of all students with long hair, the probability that a student is 
female. 

e h The probability that a student is female or has long hair. 

e i The probability that a randomly selected student is a male 
student with short hair. 

e j The probability that a student is female. 


Solution: 


e a P(L')=P(S) 
b P(M or S) 
c P(F and L) 
d P(M|L) 

e P(L|M) 

f P(S|F) 

g P(F|L) 

h P(F or L) 
e i P(M andS) 
PCP) 


**With contributions from Roberta Bloom 


Glossary 


Conditional Probability 
The likelihood that an event will occur given that another event has 
already occurred. 


Equally Likely 
Each outcome of an experiment has the same probability. 


Experiment 
A planned activity carried out under controlled conditions. 


Event 
A subset in the set of all outcomes of an experiment. The set of all 
outcomes of an experiment is called a sample space and denoted 
usually by S. An event is any arbitrary subset in S. It can contain one 
outcome, two outcomes, no outcomes (empty subset), the entire 
sample space, etc. Standard notations for events are capital letters such 
as A, B, CG, etc. 


Outcome (observation) 
A particular result of an experiment. 


Probability 
A number between 0 and 1, inclusive, that gives the likelihood that a 
specific event will occur. The foundation of statistics is given by the 
following 3 axioms (by A. N. Kolmogorov, 1930’s): Let S denote the 
sample space and A and B are two events in S'. Then: 


* 0 < P(A) <1; 

e If A and B are any two mutually exclusive events, then 
P(Aor B) = P(A) + P(B). 

* P(S)=1. 


Sample Space 
The set of all possible outcomes of an experiment. 


Probability Topics: Independent & Mutually Exclusive Events 

Probability: Independent and Mutually Exclusive Events is part of the 
collection col10555 written by Barbara Illowsky and Susan Dean and 
explains the concept of independent events, where the probability of event 
A does not have any effect on the probability of event B, and mutually 
exclusive events, where events A and B cannot occur at the same time. The 
module has contributions from Roberta Bloom. 


Independent and mutually exclusive do not mean the same thing. 


Independent Events 


Two events are independent if the following are true: 


» P(A|B) = P(A) 


Two events A and B are independent if the knowledge that one occurred 
does not affect the chance the other occurs. For example, the outcomes of 
two roles of a fair die are independent events. The outcome of the first roll 
does not change the probability for the outcome of the second roll. To show 
two events are independent, you must show only one of the above 
conditions. If two events are NOT independent, then we say that they are 
dependent. 


Sampling may be done with replacement or without replacement. 


¢ With replacement: If each member of a population is replaced after it 
is picked, then that member has the possibility of being chosen more 
than once. When sampling is done with replacement, then events are 
considered to be independent, meaning the result of the first pick will 
not change the probabilities for the second pick. 

e Without replacement:: When sampling is done without replacement, 
then each member of a population may be chosen only once. In this 
case, the probabilities for the second pick are affected by the result of 


the first pick. The events are considered to be dependent or not 
independent. 


If it is not known whether A and B are independent or dependent, assume 
they are dependent until you can show otherwise. 


Mutually Exclusive Events 


A and B are mutually exclusive events if they cannot occur at the same 
time. This means that A and B do not share any outcomes and 
P(A AND B) = 0. 


For example, suppose the sample space S = {1, 2, 3, 4, 5, 6, 7, 8, 9, 10}. 
Let A = {1, 2,3, 4,5}, B = {4, 5, 6, 7, 8}, and C = {7, 9}. 

A AND B = {4,5}. P(A AND B) = and is not equal to zero. 
Therefore, A and B are not mutually exclusive. A and C do not have any 
numbers in common so P(A AND C) = 0. Therefore, A and Care 
mutually exclusive. 


If it is not known whether A and B are mutually exclusive, assume they 
are not until you can show otherwise. 


The following examples illustrate these definitions and terms. 


Example: 

Flip two fair coins. (This is an experiment.) 

The sample space is {HH, HT, TH, TT} where T = tails and H = heads. 
The outcomes are HH, HT, TH, and TT. The outcomes HT and TH are 
different. The HT means that the first coin showed heads and the second 
coin showed tails. The ‘TH means that the first coin showed tails and the 
second coin showed heads. 


e Let A = the event of getting at most one tail. (At most one tail means 
0 or 1 tail.) Then A can be written as {HH, HT, TH}. The outcome 
HH shows 0 tails. HT and TH each show 1 tail. 


¢ Let B= the event of getting all tails. B can be written as {TT}. B is 
the complement of A. So, B = A’. Also, 
P(A) + P(B) = P(A) + P(A’) = 1. 

e The probabilities for A and for B are P(A) = + and P(B) = i. 

e Let C = the event of getting all heads. C = {HH}. Since B = {TT}, 
P(B AND C) = 0. Band C are mutually exclusive. (B and C have 
no members in common because you cannot have all tails and all 
heads at the same time.) 

e Let D = event of getting more than one tail. D = {TT}. P(D) = + 

¢ Let & = event of getting a head on the first roll. (This implies you can 
get either a head or tail on the second roll.) FE = {HT, HH}. 
Ph). 

e Find the probability of getting at least one (1 or 2) tail in two flips. 
Let F' = event of getting at least one tail in two flips. 


Faleeen (EUR AUS (Absit 20a) 


Example: 

Roll one fair 6-sided die. The sample space is {1, 2, 3, 4, 5, 6}. Let event 
A =a face is odd. Then A = {1, 3, 5}. Let event B =a face is even. Then 
B = {2, 4, 6}. 


e Find the complement of A, A’. The complement of A, A’, is B 
because A and B together make up the sample space. 
P(A) + P(B) = P(A) + P(A’) = 1. Also, P(A) = 3 and 
P(B) = ¢ 

e Let event C = odd faces larger than 2. Then C' = {3, 5}. Let event D 
= all even faces smaller than 5. Then D = {2,4}. P(C and D) = 0 
because you cannot have an odd and even face at the same time. 
Therefore, C' and D are mutually exclusive events. 

e Let event F = all faces less than 5. H = {1, 2, 3, 4}. 
Exercise: 


Problem: 


Are Cand E mutually exclusive events? (Answer yes or no.) 
Why or why not? 


Solution: 


No. C = {3, 5} and E = {1, 2, 3, 4}. P(C AND E) = <. Tobe 
mutually exclusive, P(C AND E) must be 0. 


e Find P(C|A). This is a conditional. Recall that the event C’ is {3, 5} 
and event A is {1, 3, 5}. To find P(C|A), find the probability of C 
using the sample space A. You have reduced the sample space from 
the original sample space {1, 2, 3, 4, 5, 6} to {1, 3, 5}. So, 

P(C|A) = 4 


Example: 

Let event G = taking a math class. Let event H = taking a science class. 
Then, G AND H = taking a math class and a science class. Suppose 
P(G) = 0.6, P(H) = 0.5, and P(G AND H) = 0.3. Are G and H 
independent? 

If G and A are independent, then you must show ONE of the following: 


» P(G|H) = P(G) 


Note:The choice you make depends on the information you have. You 
could choose any of the methods here because you have the necessary 
information. 


Exercise: 


Problem: Show that P(G|H) = P(G). 


Solution: 


P(G AND H 
P(G|H) = “Sa = 38 = 0.6 = PG) 


Exercise: 
Problem: Show P(G AND H) = P(G)- P(H). 


Solution: 


P(G)-P(H) = 0.6-0.5 = 0.3 = P(G AND H) 


Since G and H are independent, then, knowing that a person is taking a 
science class does not change the chance that he/she is taking math. If the 
two events had not been independent (that is, they are dependent) then 
knowing that a person is taking a science class would change the chance 
he/she is taking math. For practice, show that P(H|G) = P(H) to show 
that G and H are independent events. 


Example: 

In a box there are 3 red cards and 5 blue cards. The red cards are marked 
with the numbers 1, 2, and 3, and the blue cards are marked with the 
numbers 1, 2, 3, 4, and 5. The cards are well-shuffled. You reach into the 
box (you cannot see into it) and draw one card. 

Let R = red card is drawn, B = blue card is drawn, & = even-numbered 
card is drawn. 

The sample space S = R1, R2, R3, B1, B2, B3, B4, B5. S has 8 
outcomes. 


e2B Ch 3. P(B) = 2. P(R AND B) = 0. (You cannot draw one 
card that is both red and blue.) 


e P(E) = 3. (There are 3 even-numbered cards, R2, B2, and B4.) 

¢ P(E|B) = 2. (There are 5 blue cards: B1, B2, B3, B4, and Bd. Out 
of the blue cards, there are 2 even cards: B2 and B4.) 

¢ P(B|E) = &. (There are 3 even-numbered cards: R2, B2, and B4. 
Out of the even-numbered cards, 2 are blue: B2 and B4.) 

e The events R and B are mutually exclusive because 
P{R.AND B) = 0: 

e Let G = card with a number greater than 3. G = {B4, B5}. 
B(G) = 2. Let = blue card numbered between 1 and 4, inclusive. 
H = {B1, B2, B3, B4}. P(G|H) = 4. (The only card in H that has 
a number greater than 3 is B4.) Since = +, P(G) = P(G|H) 
which means that G and H are independent. 


Example: 

In a particular college class, 60% of the students are female. 50 % of all 
students in the class have long hair. 45% of the students are female and 
have long hair. Of the female students, 75% have long hair. Let F be the 
event that the student is female. Let L be the event that the student has long 
hair. One student is picked randomly. Are the events of being female and 
having long hair independent? 


e The following probabilities are given in this example: 
eR (R= 060 P(e) 40- 50 

e P(F AND L) = 0.45 

e P(L|F) = 0.75 


Note:The choice you make depends on the information you have. You 
could use the first or last condition on the list for this example. You do not 
know P(F|L) yet, so you can not use the second condition. 


Solution 1 

Check whether P(F and L) = P(F)P(L): We are given that P(F and L) = 0.45 
; but P(F)P(L) = (0.60)(0.50)= 0.30 The events of being female and having 
long hair are not independent because P(F and L) does not equal P(F)P(L). 
Solution 2 

check whether P(L|F) equals P(L): We are given that P(L|F) = 0.75 but 
P(L) = 0.50; they are not equal. The events of being female and having 
long hair are not independent. 

Interpretation of Results 

The events of being female and having long hair are not independent; 
knowing that a student is female changes the probability that a student has 
long hair. 


**Example 5 contributed by Roberta Bloom 


Glossary 


Independent Events 
The occurrence of one event has no effect on the probability of the 
occurrence of any other event. Events A and B are independent if one 
of the following is true: (1). P(A|B) = P(A); (2) P(B|A) = P(B); 
(3) P(Aand B) = P(A)P(B). 


Mutually Exclusive 
An observation cannot fall into more than one class (category). Being 
in more than one category prevents being in a mutually exclusive 
category. 


Probability Topics: Contingency Tables 
This module introduces the contingency table as a way of determining conditional 
probabilities. 


A contingency table provides a way of portraying data that can facilitate calculating 
probabilities. The table helps in determining conditional probabilities quite easily. The 
table displays sample values in relation to two different variables that may be dependent 
or contingent on one another. Later on, we will use contingency tables again, but in 
another manner. Contingincy tables provide a way of portraying data that can facilitate 
calculating probabilities. 


Example: 
Suppose a study of speeding violations and drivers who use car phones produced the 
following fictional data: 


Speeding violation in No speeding violation in 

the last year the last year Total 
Cappel 25 280 305 
user 
een ces 45 405 450 
phone user 
Total 70 685 755 


The total number of people in the sample is 755. The row totals are 305 and 450. The 
column totals are 70 and 685. Notice that 305 + 450 = 755 and 70 + 685 = 755. 
Calculate the following probabilities using the table 

Exercise: 


Problem: P(person is a car phone user) = 


Solution: 


number of car phone users __ 305 
total number in study — 755 


Exercise: 


Problem: P(person had no violation in the last year) = 


Solution: 


number that had no violation __ 685 
total number in study 55 


Exercise: 


Problem: 
P(person had no violation in the last year AND was a car phone user) = 


Solution: 


280 
795 


Exercise: 


Problem: 


P(person is a car phone user OR person had no violation in the last year) = 


Solution: 

(355 +755) “755 = 55 
Exercise: 

Problem: 


P(person is a car phone user GIVEN person had a violation in the last year) = 
Solution: 


2. (The sample space is reduced to the number of persons who had a violation.) 
Exercise: 


Problem: 
P(person had no violation last year GIVEN person was not a car phone user) = 


Solution: 


== (The sample space is reduced to the number of persons who were not car phone 


users. ) 


Example: 
The following table shows a random sample of 100 hikers and the areas of hiking 
preferred: 


The Near Lakes and On Mountain 
Sex Coastline Streams Peaks Total 
Female 18 16 ae: 45 
Male — — 14 55 
Total = 41 


Hiking Area Preference 


Exercise: 


Problem: Complete the table. 


Solution: 
The Near Lakes and On Mountain 
Sex Coastline Streams Peaks Total 
Female 18 16 11 45 
Male 16 25 14 55 
Total 34 41 25 100 


Hiking Area Preference 


Exercise: 


Problem: 
Are the events "being female" and "preferring the coastline" independent events? 
Let F' = being female and let C’' = preferring the coastline. 


* aP(F AND C) = 
¢ bP(F) - P(C) = 


Are these two numbers the same? If they are, then F and C’ are independent. If they 
are not, then F’ and C are not independent. 


Solution: 
¢ aP(F AND C) = == =0.18 
¢ bP(F)-P(C) = > - 4a = 0.45 -0.34 = 0.153 


P(F AND C) # P(F) - P(C), so the events F' and C are not independent. 
Exercise: 

Problem: 

Find the probability that a person is male given that the person prefers hiking near 


lakes and streams. Let M = being male and let L = prefers hiking near lakes and 
streams. 


e¢ aWhat word tells you this is a conditional? 
¢ bFill in the blanks and calculate the probability: P(__|__) = 
e cls the sample space for this problem all 100 hikers? If not, what is it? 


Solution: 
e aThe word 'given' tells you that this is a conditional. 
e bP(M|L) = 2 
e cNo, the sample space for this problem is 41. 


Exercise: 


Problem: 


Find the probability that a person is female or prefers hiking on mountain peaks. 
Let F = being female and let P = prefers mountain peaks. 


aP(F) = 
3 DE CE = 
¢ cP(F AND P) = 
e dTherefore, P(F OR P) = 


Solution: 
_ 45 
e aP(F) = Ry 
¢ bP(P) = Fae 
e cP(F AND P) = a 
° dP(FORP)= 3 +4-#=2 
Example: 


Muddy Mouse lives in a cage with 3 doors. If Muddy goes out the first door, the 
probability that he gets caught by Alissa the cat is ~ and the probability he is not caught 
is 2. If he goes out the second door, the probability he gets caught by Alissa is - and 
the probability he is not caught is 3. The probability that Alissa catches Muddy coming 
out of the third door is - and the probability she does not catch Muddy is S$. It is 


equally likely that Muddy will choose any of the three doors so the probability of 
choosing each door is = 


Caught or Not Door One Door Two Door Three Total 
1 1 i 

Caught TH Tol ‘ei 

Not Caught “= — 4 


Caught or Not Door One Door Two Door Three 


Total 


Door Choice 


° The first entry == = (+) (4) is P(Door One AND Caught). 
¢ The entry = = () SS is P(Door One AND Not Caught). 


Verify the remaining entries. 
Exercise: 


Problem: 


Total 


Complete the probability contingency table. Calculate the entries for the totals. 


Verify that the lower-right corner entry is 1. 


Solution: 


Caught or Not Door One Door Two Door Three 


1 1 il 
Cau ght 15 TD ras 
Not Caught = a 4 
Total aa ~ = 


Door Choice 


Exercise: 


Problem: What is the probability that Alissa does not catch Muddy? 


Solution: 


AL 
60 


Exercise: 


Total 


19 
60 


AL 
60 


1 


Problem: 


What is the probability that Muddy chooses Door One OR Door Two given that 
Muddy is caught by Alissa? 


Solution: 


=e 
19 


Note: You could also do this problem by using a probability tree. See the Tree Diagrams 
(Optional) section of this chapter for examples. 


Glossary 


Contingency Table 
The method of displaying a frequency distribution as a table with rows and columns 
to show how two variables may be dependent (contingent) upon each other. The 
table provides an easy way to calculate conditional probabilities. 


Probability Topics: Summary of Formulas 

This module provides a review of the probability formulas, including the 
definitions of independent, complementary, and mutually exclusive events 
as well as the addition and multiplication rules. 

Formula 

Compliment 


If Aand are complements then 
Formula 
Mutually Exclusive 


If A and B are mutually exclusive then ; SO 


If A and B are NOT mutually exclusive then 


Formula 
Independence 


If A and B are independent then: 


Probability Topics: Practice with Contingency Tables 

This module provides the opportunity for students to apply what they've learned about probability to 
solve a series of problems given a set of data. Students will practice constructing and interpreting 
contingency tables. 


Student Learning Outcomes 


e The student will construct and interpret contingency tables. 


Given 

An article in the New England Journal of Medicine , reported about a study of smokers in California 
and Hawaii. In one part of the report, the self-reported ethnicity and smoking levels per day were 
given. Of the people smoking at most 10 cigarettes per day, there were 9886 African Americans, 2745 
Native Hawaiians, 12,831 Latinos, 8378 Japanese Americans, and 7650 Whites. Of the people 
smoking 11-20 cigarettes per day, there were 6514 African Americans, 3062 Native Hawaiians, 4932 
Latinos, 10,680 Japanese Americans, and 9877 Whites. Of the people smoking 21-30 cigarettes per 
day, there were 1671 African Americans, 1419 Native Hawaiians, 1406 Latinos, 4715 Japanese 
Americans, and 6062 Whites. Of the people smoking at least 31 cigarettes per day, there were 759 


African Americans, 788 Native Hawaiians, 800 Latinos, 2305 Japanese Americans, and 3970 Whites. 
((Source: http://www.nejm.org/doi/full/10.1056/NEJMoa033250)) 


Complete the Table 


Complete the table below using the data provided. 


Smoking African Native Japanese 
Level American Hawaiian Latino Americans White TOTALS 


1-10 
11-20 
21-30 
31+ 
TOTALS 


Smoking Levels by Ethnicity 


Analyze the Data 


Suppose that one person from the study is randomly selected. 


Exercise: 


Problem: Find the probability that person smoked 11-20 cigarettes per day. 


Solution: 


35,065 
100,450 


Exercise: 


Problem: Find the probability that person was Latino. 


Solution: 


19,969 
100,450 


Discussion Questions 


Exercise: 


Problem: 


In words, explain what it means to pick one person from the study and that person is “Japanese 
American AND smokes 21-30 cigarettes per day.” Also, find the probability. 
Solution: 


4,715 
100,450 


Exercise: 


Problem: 


In words, explain what it means to pick one person from the study and that person is “Japanese 
American OR smokes 21-30 cigarettes per day.” Also, find the probability. 


Solution: 


36,636 
100,450 


Exercise: 


Problem: 


In words, explain what it means to pick one person from the study and that person is “Japanese 
American GIVEN that person smokes 21-30 cigarettes per day.” Also, find the probability. 


Solution: 


4715 
15,273 


Exercise: 


Problem: Prove that smoking level/day and ethnicity are dependent events. 


Probability Topics: Homework 
This module provides a number of homework exercises related to Probability. 
Exercise: 


Problem: 


Suppose that you have 8 cards. 5 are green and 3 are yellow. The 5 green cards are 
numbered 1, 2, 3, 4, and 5. The 3 yellow cards are numbered 1, 2, and 3. The cards are well 
shuffled. You randomly draw one card. 


¢ G=card drawn is green 
e F/=card drawn is even-numbered 


e aList the sample space. 

* bP(G) = 

¢ cP(G|E) = 

e dP(G AND E) = 

e eP(G ORE) = 

e fAre G and F mutually exclusive? Justify your answer numerically. 


Solution: 
¢ a{Gl1, G2, G3, G4, G5, Y1, Y2, Y3} 
e b2 
°c 
ed 
e 


agora 7l 


e f{No 


Exercise: 


Problem: 


Refer to the previous problem. Suppose that this time you randomly draw two cards, one at a 
time, and with replacement. 


¢ G = first card is green 
¢ G2= second card is green 


e aDraw a tree diagram of the situation. 

b P(G; AND G3) = 

¢ c P(at least one green) = 

d P(G | Gi) = 

¢ eAre G2 and G; independent events? Explain why or why not. 


Exercise: 


Problem: 


Refer to the previous problems. Suppose that this time you randomly draw two cards, one at 
a time, and without replacement. 


G = first card is green 
G 9= second card is green 


aDraw a tree diagram of the situation. 

cP(at least one green) = 

dP(G2|G1) = 

eAre G2 and G, independent events? Explain why or why not. 


Solution: 
- b(3)(4) 
© (3)(F) +(s)(F) + (B)4) 
od 
e eNo 
Exercise: 


Problem: Roll two fair dice. Each die has 6 faces. 


aList the sample space. 

bLet A be the event that either a 3 or 4 is rolled first, followed by an even number. Find 
P(A). 

cLet B be the event that the sum of the two rolls is at most 7. Find P(B). 

din words, explain what “P(A|B)” represents. Find P(A|B). 

eAre A and B mutually exclusive events? Explain your answer in 1 - 3 complete 
sentences, including numerical justification. 

fAre A and B independent events? Explain your answer in 1 - 3 complete sentences, 
including numerical justification. 


Exercise: 


Problem: 


A special deck of cards has 10 cards. Four are green, three are blue, and three are red. When 
a card is picked, the color of it is recorded. An experiment consists of first picking a card 
and then tossing a coin. 


aList the sample space. 
bLet A be the event that a blue card is picked first, followed by landing a head on the 
coin toss. Find P(A). 


e cLet B be the event that a red or green is picked, followed by landing a head on the 
coin toss. Are the events A and B mutually exclusive? Explain your answer in 1 - 3 
complete sentences, including numerical justification. 

e dLet C be the event that a red or blue is picked, followed by landing a head on the coin 
toss. Are the events A and C’ mutually exclusive? Explain your answer in 1 - 3 
complete sentences, including numerical justification. 


Solution: 


¢ a{GH,GT,BH,BT,RH,RT} 


Exercise: 


Problem: An experiment consists of first rolling a die and then tossing a coin: 


e aList the sample space. 

¢ bLet A be the event that either a 3 or 4 is rolled first, followed by landing a head on the 
coin toss. Find P(A). 

¢ cLet B be the event that a number less than 2 is rolled, followed by landing a head on 
the coin toss. Are the events A and B mutually exclusive? Explain your answer in 1 - 3 
complete sentences, including numerical justification. 


Exercise: 
Problem: 


An experiment consists of tossing a nickel, a dime and a quarter. Of interest is the side the 
coin lands on. 


e aList the sample space. 

¢ bLet A be the event that there are at least two tails. Find P(A). 

¢ cLet B be the event that the first and second tosses land on heads. Are the events A and 
B mutually exclusive? Explain your answer in 1 - 3 complete sentences, including 
justification. 


Solution: 


¢ a {(HHH),(HHT),(HTH),(HTT) ,(THH),(THT),(TTH),(TTT)} 
e bz 
e cYes 


Exercise: 


Problem: Consider the following scenario: 


¢ Let P(C) = 0.4 
* Let P(D) = 0.5 
* Let P(C|D) = 0.6 


aFind P(C AND D). 

bAre C’ and D mutually exclusive? Why or why not? 
cAre C and D independent events? Why or why not? 
dFind P(C ANDD). 

eFind P(D|C). 


Exercise: 


Problem: £ and F mutually exclusive events. P(E) = 0.4; P(F) = 0.5. Find P(E | F). 


Solution: 


0 


Exercise: 


Problem: J and K are independent events. P(J | K) = 0.3. Find P(J) . 


Exercise: 


Problem: U and V are mutually exclusive events. P(U) = 0.26; P(V) = 0.37. Find: 


« aP(U AND V) = 
* bP(U|V)= 
* cP(U OR V) = 


Solution: 


e al 
e bO 
e c0.63 


Exercise: 


Problem: 


Q and R are independent events. P(Q) = 0.4; P(Q AND R) = 0.1. Find P(R). 


Exercise: 


Problem: Y and Z are independent events. 


¢ a Rewrite the basic Addition Rule P(Y OR Z) = P(Y) + P(Z) — P(Y AND Z) 
using the information that Y and Z are independent events. 
¢ b Use the rewritten rule to find P(Z) if P(Y OR Z) = 0.71 and P(Y) = 0.42. 


Solution: 


e b0.5 
Exercise: 


Problem: G and H are mutually exclusive events. P(G) = 0.5; P(H) = 0.3 


¢ aExplain why the following statement MUST be false: P(H | G) = 0.4. 
¢ bFind: P(H OR G). 
e cAre G and H independent or dependent events? Explain in a complete sentence. 


Exercise: 
Problem: 
The following are real data from Santa Clara County, CA. As of March 31, 2000, there was 


a total of 3059 documented cases of AIDS in the county. They were grouped into the 
following categories (Source: Santa Clara County Public H.D.): 


IV 
Drug Heterosexual 
Homosexual/Bisexual User* Contact Other Totals 
Female 0 70 136 49 
Male 2146 463 60 135 


Totals a 
* includes homosexual/bisexual IV drug users 


Suppose one of the persons with AIDS in Santa Clara County is randomly selected. 
Compute the following: 


¢ a P(person is female) = 


¢ bP(person has a risk factor Heterosexual Contact) = 


¢ cP(person is female OR has a risk factor of IV Drug User) = 
e dP(person is female AND has a risk factor of Homosexual/Bisexual) = 
¢ eP(person is male AND has a risk factor of IV Drug User) = 
¢ fP(female GIVEN person got the disease from heterosexual contact) = 


Solution: 


The completed contingency table is as follows: 


IV 
Drug 
Homosexual/Bisexual User* 
Female 0 70 
Male 2146 463 
Totals 2146 533 


* includes homosexual/bisexual IV drug users 


Exercise: 


Problem: 


Heterosexual 
Contact 


136 


60 


196 


Other 


49 


135 


174 


Totals 


255 


2804 


3059 


A previous year, the weights of the members of the San Francisco 49ers and the Dallas 
Cowboys were published in the San Jose Mercury News. The factual data are compiled into 


the following table. 


Shirt# < 210 211-250 251-290 290< 


1-33 21 S) 0 0 
34-66 6 18 7 4 
66-99 6 12 22 5 


For the following, suppose that you randomly select one player from the 49ers or Cowboys. 


e aFind the probability that his shirt number is from 1 to 33. 

e bFind the probability that he weighs at most 210 pounds. 

e cFind the probability that his shirt number is from 1 to 33 AND he weighs at most 210 
pounds. 

e dFind the probability that his shirt number is from 1 to 33 OR he weighs at most 210 
pounds. 

e eFind the probability that his shirt number is from 1 to 33 GIVEN that he weighs at 
most 210 pounds. 

e flf having a shirt number from 1 to 33 and weighing at most 210 pounds were 
independent events, then what should be true about P(Shirt# 1-33 | < 210 pounds)? 


Exercise: 


Problem: 


Approximately 249,000,000 people live in the United States. Of these people, 31,800,000 
speak a language other than English at home. Of those who speak another language at home, 
over 50 percent speak Spanish. (Source: U.S. Bureau of the Census, 1990 Census) 


Let: & = speak English at home; E’ = speak another language at home; S = speak Spanish at 
home 


Finish each probability statement by matching the correct answer. 


Probability Statements Answers 
a. P(E’) = i. 0.8723 
b. P(E) = ii. > 0.50 


c. P(S) = iii. 0.1277 


Probability Statements Answers 


d. P(S|E') = iv. > 0.0639 


Solution: 


e aiii 
e bi 
e civ 


e dii 


The next two questions refer to the following: The percent of licensed U.S. drivers (from a 
recent year) that are female is 48.60. Of the females, 5.03% are age 19 and under; 81.36% are age 
20 - 64; 13.61% are age 65 or over. Of the licensed U.S. male drivers, 5.04% are age 19 and 
under; 81.43% are age 20 - 64; 13.53% are age 65 or over. (Source: Federal Highway 
Administration, U.S. Dept. of Transportation) 


Try these multiple choice questions. 


The next three questions refer to the following table of data obtained from www.baseball- 
almanac.com showing hit information for 4 well known baseball players. Suppose that one hit 
from the table is randomly selected. 


NAME Single Double Triple Home Run TOTAL HITS 

Babe Ruth 1517 506 136 714 2873 

Jackie Robinson 1054 273 54 137 1518 

Ty Cobb 3603 174 295 114 4189 

Hank Aaron 2294 624 98 755 3771 

TOTAL 8471 1577 583 1720 12351 
Exercise: 


Problem: Find P(hit was made by Babe Ruth). 


1518 
2873 
2873 
12351 
583 
12351 
4189 
12351 


5SOD pS 


Solution: 


B 


Exercise: 


Problem: Find P(hit was made by Ty Cobb | The hit was a Home Run) 


4189 
pe 
° Bino 
. Cc 120 


a8 
° D a351 


Solution: 


B 
Exercise: 
Problem: 


Are the hit being made by Hank Aaron and the hit being a double independent 
events? 


¢ A Yes, because P(hit by Hank Aaron | hit is a double) = P(hit by Hank Aaron) 
¢ BNo, because P(hit by Hank Aaron | hit is a double) 4 P(hit is a double) 
e C No, because 
P(hit is by Hank Aaron | hit is a double) ¢ P(hit by Hank Aaron) 
¢ D Yes, because P(hit is by Hank Aaron | hit is a double) = P(hit is a double) 


Solution: 


S 


Probability Topics: Review 
This module provides a number of homework/review exercises related to 
Probability. 


The first six exercises refer to the following study: In a survey of 100 
stocks on NASDAQ, the average percent increase for the past year was 9% 
for NASDAQ stocks. Answer the following: 

Exercise: 


Problem: The “average increase” for all NASDAQ stocks is the: 
e APopulation 
e BStatistic 
e CParameter 
¢ DSample 
e EVariable 


Solution: 


e C Parameter 
Exercise: 


Problem: All of the NASDAQ stocks are the: 


e APopulation 
e BStatistic 
e CParameter 
¢ DSample 
e EVariable 


Solution: 


e A Population 


Exercise: 


Problem: 9% is the: 
e APopulation 
e BStatistic 
e CParameter 
¢ DSample 
e EVariable 


Solution: 


e B Statistic 
Exercise: 


Problem: The 100 NASDAQ stocks in the survey are the: 
e APopulation 
e BStatistic 
e CParameter 
¢ DSample 
e EVariable 


Solution: 


¢ D Sample 
Exercise: 


Problem: The percent increase for one stock in the survey is the: 


e APopulation 
e BStatistic 


e CParameter 
¢ DSample 
e EVariable 
Solution: 
e E Variable 
Exercise: 


Problem: 


Would the data collected be qualitative, quantitative — discrete, or 
quantitative — continuous? 


Solution: 


quantitative - continuous 


The next two questions refer to the following study: Thirty people spent 
two weeks around Mardi Gras in New Orleans. Their two-week weight gain 
is below. (Note: a loss is shown by a negative weight gain.) 


Weight Gain Frequency 
-2 3 
-1 is) 


0 2 


Weight Gain Frequency 


1 4 

4 13 

6 2 

11 1 
Exercise: 


Problem: Calculate the following values: 
e aThe average weight gain for the two weeks 
e bThe standard deviation 
e cThe first, second, and third quartiles 
Solution: 
e a2.27 


° b3.04 
° c-1,4,4 


Exercise: 


Problem: Construct a histogram and a boxplot of the data. 


